JAMA Pediatrics
● American Medical Association (AMA)
Preprints posted in the last 30 days, ranked by how well they match JAMA Pediatrics's content profile, based on 10 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.
Scanlon, I.; Rawlings, A.; Tucker, D.; Thayer, D. S.; Evans, H. T.; Given, J.; Jones, S.; Loane, M.; Morgan, C.; Morris, J. K.; Jordan, S.
Show abstract
Background Education outcomes predict life chances. However, poverty, ill-health and disability are barriers to achievement. We examined determinants of academic attainment of children with and without major congenital anomalies in state-funded mainstream schools at ages 11 and 16 (key stages [KS] 2 and 4). Methods and Findings Routinely collected electronic records for children born in Wales 01/01/1998-31/12/2007 until 31/12/2019 were linked in the Secure Anonymised Information Linkage (SAIL) Databank. Education outcomes were explored using logistic regression, adjusting for: anomalies, maternal and child deprivation, prescribing, hospitalisation, gestation length, childs sex, and special education needs (SEN) provision. Children with anomalies were less likely to achieve academic standards: however, attainment was more closely associated with affluence. At age 11, 81.87% (7167/8754) with and 93.80% (232,450/247,814) without anomalies passed (odds ratio [OR] 0.30, 95% confidence intervals [CI] 0.28-0.32). At age 16, 46.76% (2070/4427) with and 56.10% (69,732/124,300) without anomalies achieved 5 General Certificates of Secondary Education (GCSEs) at grades C-A* including English/Welsh, Maths and Science (EWMS) (OR 0.69, 0.65-0.73). Discrepancies narrowed in adjusted analyses, particularly when SEN provision was accounted: aOR 0.72 (0.66-0.78) at KS2, and aOR 0.93, (0.87-1.00) for 5 GCSEs C-A* with EWMS. These GCSEs were achieved by 29.65% (307/1034) children with anomalies and 38.42% (10,875/28,305) of unaffected children in the most deprived quintile{dagger}: in the most affluent quintile, figures were 67.57% (547/810) and 74.98% (16,978/22,644). Children with anomalies, receiving maximum SEN support, eligible for Free School Meals (FSM) were the least successful: 5/192 (2.6%) passed 5 GCSEs C-A* with EWMS, as did 37/354 (10.4%) ineligible for FSM. The strongest associations with these GCSEs were SEN statements (aOR 0.07, 0.06-0.07), FSM eligibility (aOR 0.39, 0.37-0.41), and epilepsy (aOR 0.60, 0.45-0.80). However, data were unavailable for 15-18% of children, mainly those educated outside mainstream schools, and some co-morbidities. Generalisation of findings to other countries rests with readers. Conclusions Many children with anomalies from affluent households succeeded. The children left behind lived with poverty and ill-health from congenital anomalies and/or epilepsy. SEN provision mitigated the impact of disadvantage, but poor children with anomalies were unlikely to succeed. {dagger}taking maternal Welsh Index of Multiple Deprivation (WIMD) 2014 at birth.
Leuenberger, L. M.; Shoman, Y.; Romero, F.; Deligianni, X.; Hartung, A.; Mozun, R.; Goebel, N.; Bielicki, J. A.; Burckhardt, M.-A.; Latzin, P.; Saner, C.; Posfay-Barbe, K. M.; Schwitzgebel, V.; Giannoni, E.; Hauschild, M.; Stocker, M.; Righini-Grunder, F.; Lauener, R.; Mueller, P.; Schlapbach, L. J.; Jenni, O. G.; Spycher, B. D.; Kuehni, C. E.; Belle, F. N.; for the SwissPedHealth Consortium,
Show abstract
OBJECTIVE: Anthropometric data are critical in paediatric care, routinely assessed during clinical visits, and available in electronic health records (EHRs). We describe the feasibility of extracting anthropometric data from heterogeneous EHR systems of Swiss childrens hospitals, evaluate their availability and quality, and assess the cohorts representativeness of the general population. METHODS: In this multicentre study (SwissPedGrowth), we retrospectively collected EHRs from patients <20 years who visited hospitals in Basel, Bern, Geneva, Lausanne, Luzern, St. Gallen, or Zurich between 2017-2023. Sociodemographic, administrative, and clinical information from EHRs were provided in a standardized way by a paediatric national data stream (SwissPedHealth), including the Swiss Neighbourhood Index of Socioeconomic Position (Swiss-SEP). We counted anthropometric recordings per visit to describe availability and used a self-developed and an existing (growthcleanr) algorithm to investigate data quality. To assess representativeness, we compared sociodemographic characteristics between SwissPedGrowth and the general paediatric population in Switzerland, computed standardized differences (effect size: 0.2 small, 0.5 medium, 0.8 large), and weighted the study population to reduce differences. RESULTS: We included 477,531 patients and 2,171,633 hospital visits; 54% boys, 71% Swiss, mean Swiss-SEP 65 (SD: 11), and median age at visit 6.3 [IQR: 2.3, 11.8] years. Height recordings were available for 20% of the visits, weights for 43%, and head circumferences for 5%, with better availability for inpatient stays than outpatient or emergency visits. Combining the self-developed and existing algorithm, 4% of heights and 3% of weights were flagged as outliers and 29% of heights and 31% of weights as carried forward from previous visits or same day duplicates. Sociodemographic differences between SwissPedGrowth and the general population were small or small-to-medium and disappeared after weighting. CONCLUSION: SwissPedGrowth demonstrates feasibility of extracting high-quality anthropometric data for paediatric growth research, but challenges regarding completeness and harmonization of EHR data across Swiss hospitals remain.
Bannett, Y.; Pillai, M.; Huang, T.; Luo, I.; Gunturkun, F.; Hernandez-Boussard, T.
Show abstract
ImportanceGuideline-concordant care for young children with attention-deficit/hyperactivity disorder (ADHD) includes recommending parent training in behavior management (PTBM) as first-line treatment. However, assessing guideline adherence through manual chart review is time-consuming and costly, limiting scalable and timely quality-of-care measurement. ObjectiveTo evaluate the accuracy and explainability of large language models (LLMs) in identifying PTBM recommendations in pediatric electronic health record (EHR) notes as a scalable alternative to manual chart review. Design, Setting, and ParticipantsThis retrospective cohort study was conducted in a community-based pediatric healthcare network in California consisting of 27 primary care clinics. The study cohort included children aged 4-6 years with [≥] 2 primary care visits between 2020-2024 and ICD-10 diagnoses of ADHD or ADHD symptoms (n=542 patients). Clinical notes from the first ADHD-related visit were included. A stratified subset of 122 notes, including all cases with model disagreement, was manually annotated to assess model performance in identifying PTBM recommendations and rank model explanations. ExposuresAssessment and plan sections of clinical notes were analyzed using three generative large language models (Claude-3.5, GPT-4o, and LLaMA-3.3-70B) to identify the presence of PTBM recommendations and generate explanatory rationales and documentation evidence. Main Outcomes and MeasuresModel performance in identifying PTBM recommendations (measured by sensitivity, positive predictive value (PPV), and F1-score) and qualitative explainability ratings of model-generated rationales (based on the QUEST framework). ResultsAll three models demonstrated high performance compared to expert chart review. Claude-3.5 showed balanced performance (sensitivity=0.89, PPV=0.95, and F1-score=0.92) and ranked highest in explainability. LLaMA3.3-70B achieved sensitivity=0.91, PPV=0.89, and F1-score=0.90, ranking second for explainability. GPT-4o had the highest PPV [0.97] but lowest sensitivity [0.82], with an F1-score of 0.89 and the lowest explainability ranking. Based on classifications from the best-performing model, Claude-3.5, 26.4% (143/542) of patients had documented PTBM recommendations at their first ADHD-related visit. Conclusions and RelevanceLLMs can accurately extract guideline-concordant clinician recommendations for non-pharmacological ADHD treatment from unstructured clinical notes while providing clear explanations and supporting evidence. Evaluating model explainability as part of LLM implementation for medical chart review tasks can promote transparent and scalable solutions for quality-of-care measurement.
Georgiades, K.; Chen, Y.-J.; Johnson, D.; Miller, R.; Wang, L.; Sim, A.; Nolan, E.; Dryburgh, N.; Edwards, J.; O'byrne, S.; Repchuck, R.; Cost, K. T.; Duncan, L.; Golberg, M.; Duku, E.; Szatmari, P.; Georgiades, S.; MacMillan, H. L.; Waddell, C.
Show abstract
Background Although an expansive body of evidence exists on children's mental health during the COVID-19 pandemic, it is largely restricted to the early phases and lockdowns. This study examines longitudinal changes in child and youth mental health symptoms across two years of the COVID-19 pandemic, with data collection strategically timed to capture variability in pandemic restrictions. Methods A population-based longitudinal study of 1,261 children and youth aged 4-17 years followed prospectively from January 2021 to December 2022, with five waves of data collected in Ontario, Canada. Latent growth curve modelling was used to estimate trajectories of parent-reported mental health symptoms and identify baseline and time-varying covariates associated with variable trajectories. Findings Mental health symptoms were elevated and stable during lockdowns, followed by significant reductions as pandemic restrictions loosened, particularly for oppositional defiant and inattention/hyperactivity symptoms compared to internalizing symptoms. Children without pre-existing clinician diagnosed physical, mental or neurodevelopmental conditions and those not in lockdown at baseline demonstrated relative increases in mental health symptoms during lockdowns; and girls, compared to boys, demonstrated smaller reductions in internalizing symptoms as restrictions loosened. Concurrent and lagged associations between parental distress and children's mental health symptoms varied across the pandemic. Interpretation Variation in symptom trajectories by mental health domain, gender, pandemic restrictions and pre-existing diagnosed conditions underscores the need for tailored, equity-informed pandemic planning and response. Policies designed to optimize the balance between the need to reduce viral community transmission whilst limiting pandemic lockdowns may mitigate adverse impacts on child and youth mental health. Funding Ontario Ministry of Health
Arildskov, E. S.; Khachadourian, V.; Grove, J.; Schendel, D.; Hansen, S. N.; Janecka, M.
Show abstract
Despite autism's prominent genetic etiology and early-life origins, parsing genetic effects contributing to the condition into those that operate directly (via allelic transmission to offspring) vs. indirectly (via influencing prenatal environment) remains challenging. We examined this using a novel design leveraging 3-generation family linkage in Danish national registers. The cohort included all children born in Denmark from 1998-2015 and their relatives identified through 3-generation family linkage. The analytic sample comprised full maternal cousin pairs, including parallel (children of mother's sister) and cross cousins (children of mother's brother). Exposures were diagnoses in the index mother previously associated with offspring autism; the outcome was autism diagnosis in cousins of the index child. We used Cox proportional hazards models to estimate associations separately in parallel and cross cousins, followed by comparisons of these hazard ratios to infer mechanisms. Several maternal diagnoses (e.g., postpartum hemorrhage, personality disorders, epilepsy) were associated with autism in both parallel and cross cousins, consistent with shared direct genetic effects. Other conditions (e.g., false labor, recurrent major depressive disorder, other anxiety disorders, systemic connective tissue involvement) showed stronger associations in parallel than cross cousins, supporting additional indirect genetic effects operating through the prenatal environment. Adjustment for the same diagnosis in the cousin's own mother did not substantially change estimates, providing no evidence for an additional role of non-genetic mechanisms associated with the diagnosis. These findings suggest that both direct and indirect genetic effects contribute to observed links between maternal health and offspring autism, highlighting etiologic heterogeneity and highlighting a registry-based family design to separate these pathways without genetic data.
Mills, E. A.; Bingham, R.; Nijman, R. G.; Sriskandan, S.
Show abstract
BackgroundAn upsurge in Streptococcus pyogenes infections 2022-2023 highlighted potential benefits of point-of-care tests (POCT) to support clinical pathways, prevent outbreaks, and optimise antibiotic use. ObjectivesWe conducted a pilot research study in a west London paediatric emergency department (ED) to determine whether a molecular POCT had potential to alter management in children who were also having a conventional throat swab taken for culture. MethodsChildren <16 years presenting to ED who had a throat swab requested by a clinician were invited to have a second swab taken for research purposes only. Clinical management was unaffected by the research swab result, which was processed using a molecular POCT that was not approved for use in the host NHS Trust. ResultsPrevalence of streptococcal infection was low during the study (May 2023-June 2025); swab positivity in symptomatic children was 12.8% (6/47). Overall, 38/49 (77.6%) participants who had throat swabs received antibiotics. Of those children recommended to receive antibiotics, 29/38 (76.3%) had a negative POCT. Mean time to reporting of positive throat swab culture results was 3.67 days (range 3-5 days) leading to occasional delay in treatment, although POCT identified positive results within minutes. ConclusionAntibiotic use was frequent and could be avoided or stopped by use of a rule out POCT in over three-quarters of children in the ED, if suspicion of S. pyogenes is the main driver for prescribing. POCT were easy to process and produced immediate results compared with culture, in theory enabling timely decision-making and avoiding treatment delay.
Wang, R. A. H.; Huang, V. S.; Sadiq, S.; Smittenaar, P.; Kemp, H.; Sgaier, S. K.
Show abstract
Introduction Social media is a central part of young people's lives, yet research on its mental health effects remains mixed. We posit that these inconsistencies stem in part from treating youth as a homogeneous group, obscuring distinct behavioral patterns associated with divergent mental health and wellbeing trajectories. Objectives This study aimed to: (1) explore heterogeneity in social media engagement styles among U.S. youth aged 15-24; and (2) examine how these engagement styles are embedded within a broader system of mental health, wellbeing, emotional regulation, belonging, family and neighborhood context, and stress and adversity. Methods Data were drawn from a 2024 nationally representative cross-sectional survey of 2,563 U.S. youth, conducted as part of the Youth Mental Health Tracker initiative. We employed unsupervised clustering to identify five distinct social media engagement profiles. Subsequently, we used Bayesian network-based causal discovery to examine (a) upstream factors that emerge as drivers of engagement styles and (b) downstream outcomes influenced by profile membership in the learned system. Results Five profiles were identified: the Perpetually Plugged-In (31.3%), characterized by near-constant multifaceted social media use, for both positive and negative purposes across multiple domains of life; the Burned-Out Browsers (21.9%), with high exposure to negative and comparison-based content with frequent attempts to disengage; the Practical Navigators (20.7%) who engage in structured, goal-oriented use focused on learning, hobbies, and maintaining connections; the Positive Engagers (13.6%) with high social and identity-driven engagement; and the Light Touch Users (12.5%) who have low overall engagement and limited reliance on social media for connection, identity, or support. Causal analyses revealed that the Perpetually Plugged-In and Burned-Out Browsers had the worst mental health and wellbeing, with their engagement driven by different reasons. While both engagement profiles were influenced by similar psychosocial risk factors, they were distinguished by their dominant drivers: contemporaneous social stressors (bullying, discrimination, and emotional dysregulation) for Perpetually Plugged-In youth, versus adverse childhood experiences for Burned-Out Browsers. In contrast, Positive Engagers reported high social media engagement alongside the highest levels of social wellbeing, using social media for identity exploration and social support within a context of low cumulative stress and adversity. Conclusions Findings suggest that youth social media risk is not driven by intensity of use alone, but by the interaction between engagement style and offline emotional and social conditions. Policies focused solely on restricting access risk overlooking these differences and may inadvertently sever important sources of connection for many youth. Strategies should identify experiential risk signals while strengthening supportive contexts that enable healthier engagement. Overall, youth social media use is best understood as part of a broader psychosocial system, and recognizing this heterogeneity is essential for designing more targeted, equitable, and evidence-based interventions.
Patterson, E.; Rossi, R.; Sallis, H.; Dennie, E.; Howe, L. D.; Emond, A. D.; Herbert, A.
Show abstract
Previous research links Adverse Childhood Experiences (ACEs) with problem gambling, but most studies rely on retrospective reporting and focus narrowly on maltreatment, overlooking adversities such as parental mental health issues. Using data on 3794 young adults in the Avon Longitudinal Study of Parents and Children, we examined longitudinal associations between 10 prospectively measured ACEs (individually and cumulatively), and moderate-risk/problem gambling (Problem Gambling Severity Index >=3) at ages 17, 20 and 24, adjusted for socioeconomic and other background factors. Population attributable fractions (PAFs) estimated proportions of cases potentially attributable to ACEs. Most ACEs were associated with higher odds of moderate-risk/problem gambling across ages (24/30 estimates) after adjustment, though effect sizes were generally small (median adjusted odds ratio [aOR] 1.31, interquartile range 1.24-1.59), and confidence intervals (CIs) wide. Sexual abuse showed the strongest association (aORs 2.4-4.2, CIs 0.5-10.5), while bullying and parental conviction were associated at ages 17 and 20 only, parental separation age 24 only. Evidence for a dose-response relationship was weak. PAFs suggested ACEs accounted for up to 12% of moderate-risk/problem gambling cases. These findings highlight potential impacts of ACEs on later gambling behaviour, but imprecise estimates suggest findings should be interpreted cautiously and strengthened through larger datasets and meta-analyses.
Mannfolk, C.; Ertl, N.; Jayasena, C. N.; Liberg, B.; Wall, M. B.; Comninos, A. N.; Rahm, C.
Show abstract
Mechanistic understanding and biomarkers of gonadotropin-releasing hormone antagonist treatment effect in paedophilic disorder are absent but may enhance outcomes and reduce sexual-offending risk. 52 help-seeking self-referred Swedish men with paedophilic disorder enrolled in a double-blinded, placebo-controlled, randomized clinical trial. Participants underwent task-based fMRI before, and two weeks after, subcutaneous injection of 120mg of degarelix or equal volume of placebo. fMRI blood-oxygen-level-dependent activation was compared between child and adult (child>adult) stimuli in task-derived regions of interest. Primary outcome was within region-of-interest child>adult activation change, whereas secondary outcomes correlated region-of-interest child>adult activation change to change in clinical measurements of risk, paedophilic interest, sexual preoccupation, hyper- and hyposexuality. 19 degarelix and 22 placebo participants had sufficient fMRI data quality. Reductions in paedophilic interest were strongly correlated with increased child>adult cerebellar (vermis) region-of-interest activation following degarelix (r=-0.740, p<0.001) but not placebo (r=0.183, p=0.41; between-group correlation coefficient z=3.347, p<0.001). Treatment did not significantly change child>adult region-of-interest activity. Post hoc analysis indicated that baseline autism symptoms correlated with degarelix-induced changes in paedophilic interest (r=0.717, p<0.001; between-group correlation coefficient z=2.958, p=0.003) and cerebellar activation (r=-0.581, p=0.01; between-group correlation coefficient z=-1.930, p=0.05). Increased child>adult cerebellar activation was associated with degarelix-induced reductions of paedophilic interest, suggesting cerebellar activity as mechanistically important to, and a prospective biomarker of, degarelix treatment effect. Additionally, autism symptoms may inform treatment prediction. Together, these findings have mechanistic and clinical implications for degarelix treatment of paedophilic disorder. EU clinical trials register identifier: 2014-000647-32 https://www.clinicaltrialsregister.eu/ctr-search/trial/2014-000647-32/SE, registered on 05/06/2014.
Smout, S.; Jung, S.; Bergink, V.; Mahjani, B.
Show abstract
Objective: Autistic individuals may face elevated risk for PTSD, yet the degree to which this risk differs by sex remains unknown. We examined the association between autism and incident PTSD, characterized sex differences in risk, identified high-risk subgroups, and described post-diagnosis clinical trajectories. Method: We conducted a population-based matched cohort study using Swedish national registers. Individuals born 1990 through 2010 were followed from age 6 years through December 31, 2017. Autistic individuals (N=42,862) were matched 1:10 to controls (N=412,251) on sex and birth year. Cox proportional hazards regression estimated hazard ratios (HRs) for incident PTSD. Among those who developed PTSD, we compared care utilization, hospitalization rates, and persistence of care contacts. Results: During mean follow-up of 5.1 years, 401 autistic individuals (0.9%) and 903 controls (0.2%) developed PTSD (incidence rates: 18.3 vs 4.2 per 10,000 person-years). Autism was associated with 4.4-fold increased PTSD risk (HR=4.37; 95% CI, 3.93-4.86). Risk was higher among females (HR=4.79) than males (HR=3.39; P interaction=.006). Among autistic individuals, comorbid ADHD conferred additional risk (HR=1.38; 95% CI, 1.14-1.68). Ten-year cumulative incidence reached 6.0% among autistic females with ADHD. Autistic individuals with PTSD had higher care utilization (mean visits: 5.0 vs 3.9; P<.001), more psychiatric hospitalizations (27.9% vs 19.8%; P=.002), and more persistent courses (24.8% vs 12.3% with contacts in all 3 post-diagnosis years; P=.001). Conclusion: Autism is associated with substantially elevated PTSD risk, particularly among females with comorbid ADHD. When PTSD occurs, autistic individuals experience more severe and persistent clinical courses, supporting targeted screening and sustained follow-up.
Dai, H.-J.; Fang, L.-C.; Mir, T. H.; Chen, C.-T.; Feng, H.-H.; Lai, J.-R.; Hsu, H.-C.; Nandy, P.; Panchal, O.; Liao, W.-H.; Tien, Y.-Z.; Chen, P.-Z.; Lin, Y.-R.; Jonnagaddala, J.
Show abstract
Objectives Publicly available datasets dedicated to clinical speech deidentification tasks remain scarce due to privacy constraints and the complexity of speech-level annotation. To address this gap, we compiled the SREDH-AICup sensitive health information (SHI) speech corpus, a time-aligned clinical speech dataset annotated across 38 SHI categories. Methods Two publicly available English medical-domain datasets were adapted to support speech-level de-identification, including script reformulation and controlled re-recorded by 25 participants. Additional Mandarin Chinese clinical-style materials were incorporated to extend linguistic coverage. All audio data were annotated with million-level, time-aligned SHI spans using Label Studio. Inter-annotator agreement was evaluated using Cohen's kappa, following iterative calibration rounds. The resulting corpus supports both automatic speech recognition (ASR) and speech-level recognition of SHIs. Results The final dataset comprises 20 hours of annotated audio, divided into training (10 hours, 1,539 files), validation (5 hours, 775 files), and test (5 hours, 710 files) subsets, totalling 7,830 SHI entities. The language distribution reflects the composition of the selected source materials, with 19.36 hours of English and 0.89 hours of Mandarin Chinese speech. Discussion The corpus exhibits a long-tail distribution consistent with clinical documentation patterns and highlights the limited availability of Chinese medical speech resources. These characteristics underscore both the realism of the dataset and structural challenges associated with multilingual speech de-identification. Conclusion The SREDH-AICup SHI speech corpus provides a clinically grounded, time-aligned speech dataset supporting automated medical speech de-identification research and facilitating future development of multilingual speech-based privacy protection systems.
Geertjens, L. L. M. G.; Cristian, G.; Ramautar, J. J. R.; Haverman, L.; Schalet, B. B. D.; Linkenkaer-Hansen, K.; van der Wilt, G.-J.; Sprengers, J. J. J.; Bruining, H.
Show abstract
Progress in pharmacological treatment development for neurodevelopmental disorders is hindered by a misalignment between targeted mechanisms, outcome measures, and trial designs. This study was initiated as a post-trial access pathway for bumetanide and later expanded with treatment-naive participants. Within this framework, we implemented a parent-cocreated sensory outcome measure set (PROMset) in an unmasked, multiple-baseline single-case experimental design with randomized baseline periods of 2-12 weeks, followed by 6 months of bumetanide treatment (up to 1.5 mg twice daily). Participants (7-19 years) had atypical sensory reactivity and a diagnosis of ASD, ADHD, epilepsy, or TSC. The primary outcome was a PROMset comprising seven PROMIS item banks assessing anxiety, depressive symptoms, sleep disturbance, fatigue, sleep-related impairment, cognitive function, and peer relationships. Secondary outcomes included SSP, SRS-2, RBS-R, and ABC. Of 113 enrolled participants (mean age 13.2 [SD 2.7], 64% male), 102 completed the trial and 95 had analyzable PROMsets. At baseline, PROMset scores showed substantial impairment across domains (mean deviation =9.0 T-score points, p<.001) and correlated with sensory reactivity (SSP; r=-0.40, p<.001). Individual-level analyses showed improvement in 24-41% of participants per PROM domain, most frequently in anxiety and depressive symptoms (41% and 38%; mean across-case Cohen's d=-1). Overall, 83% improved on at least one domain. Group-level analyses showed improvement across all secondary outcomes (p<.001), with superiority over historic placebo for RBS-R and SSP. Integrating PROMsets with individualized trial designs can reveal clinically meaningful changes, supporting a more sensitive and patient-centered framework for treatment evaluation in heterogeneous populations.
Trivedi, S.; Simons, N. W.; Tyagi, A.; Ramaswamy, A.; Nadkarni, G. N.; Charney, A. W.
Show abstract
Background: Large language models (LLMs) are increasingly used in mental health contexts, yet their detection of suicidal ideation is inconsistent, raising patient safety concerns. Objective: To evaluate whether an independent safety monitoring system improves detection of suicide risk compared with native LLM safeguards. Methods: We conducted a cross-sectional evaluation using 224 paired suicide-related clinical vignettes presented in a single-turn format under two conditions (with and without structured clinical information). Native LLM safeguard responses were compared with an independent supervisory safety architecture with asynchronous monitoring. The primary outcome was detection of suicide risk requiring intervention. Results: The supervisory system detected suicide risk in 205 of 224 evaluations (91.5%) versus 41 of 224 (18.3%) for native LLM safeguards. Among 168 discordant evaluations, 166 favored the supervisory system and 2 favored the LLM (matched odds ratio {approx}83.0). Both systems detected risk in 39 evaluations, and neither in 17. Detection was highest in scenarios with explicit suicidal ideation and lower in more ambiguous presentations. Conclusions: Native LLM safeguards frequently failed to detect suicide risk in this structured evaluation. An independent monitoring approach substantially improved detection, supporting the role of external safety systems in high-risk mental health applications of LLMs.
Varela-Mato, V.; Ngina, D.; Orton, E.; Barnes, J.
Show abstract
Background Parenting practices shape children's emotional, social, and cognitive developmental wellbeing. Yet, many families face complex challenges that increase the risk of poor outcomes and demand on social care. The Sutton Parenting Offer (SPO) is a universal, peer-led parenting offer that provides early, non-stigmatising support to families with children aged 0-25 years. It combines evidence-based programmes with informal workshops and peer networks delivered through Family Hubs. This present study is an evaluation protocol of the parenting offer. Aim This evaluation aims to explore how, why, and in what contexts SPO supports families in engaging, sustaining positive change, and generating wider system value. Methods A mixed-methods realist evaluation approach will be used to evaluate SPO across four work packages: engagement pathways, early changes and peer-led ecosystems, long-term change, and system value for money. Data sources will include attendance data (anonymised service records), survey data (entry and exit), and qualitative data (dyad interviews, story circles, and stakeholder-value mapping workshops). The COM-B and the Theoretical Domains Framework (TDF) will guide the analysis of behavioural data. Quantitative data will be analysed descriptively and using paired parametric and non-parametric tests, while qualitative data will be analysed thematically following a realist-informed approach to refine context-mechanism-outcome (CMO) configurations. Discussion This protocol presents the first realist informed evaluation of a universal parenting program in a local authority setting. The evaluation will generate evidence on how, when and why a universal, community-based, and peer-led model such as the Sutton Parenting Offer engages families and generates change. The findings will be useful to inform future parenting service design and implementation in local contexts in England.
Donegan, M. L.; Srivastava, A.; Peake, E.; Swirbul, M.; Ungashe, A.; Rodio, M. J.; Tal, N.; Margolin, G.; Benders-Hadi, N.; Padmanabhan, A.
Show abstract
The goal of this work was to leverage a large corpus of text based psychotherapy data to create novel machine learning algorithms that can identify suicide risk in asynchronous text therapy. Advances in the field of natural language processing and machine learning have allowed us to include novel data sources as well as use encoding models that can represent context. Our models utilize advanced natural language processing techniques, including fine-tuned transformer models like RoBERTa, to classify risk. Subsequent model versions incorporated non-text data such as demographic features and census-derived social determinants of health to improve equitable and culturally responsive risk assessment, as well as multiclass models that can identify tiered levels of risk. All new models demonstrated significant improvements over our previous model. Our final version, a multiclass model, provides a tiered system that classifies risk as "no risk," "moderate," or "severe" (weighted F1 of 0.85). This tiered approach enhances clinical utility by allowing providers to quickly prioritize the most urgent cases, ensuring a more accurate and timely intervention for clients in need.
Arildskov, E. S.; Ahlqvist, V. H.; Khachadourian, V.; Asgel, Z.; Schendel, D.; Hansen, S. N.; Grove, J.; Janecka, M.
Show abstract
The etiology of autism is influenced by genetic and non-genetic factors, with observational studies suggesting associations between early maternal health diagnoses and offspring autism. However, these associations may partly reflect shared familial genetic liability rather than direct causal effects. Using comprehensive national health registers and individual-level genetic data from the iPSYCH cohort (N=117,542), we examined whether maternal health diagnoses are associated with offspring polygenic scores (PGS) for autism. Such associations between maternal health and offspring autism would indicate shared genetic factors and the possibility of genetic confounding in the observational associations. We also tested such associations with PGSs for other neuropsychiatric and neurodevelopmental conditions that are genetically correlated with autism, but with better-powered PGS (due to larger GWAS sample sizes and likely more polygenic genetic architecture), as well as height, a negative control. Several maternal diagnoses were nominally associated with autism PGS in the child, including, e.g., certain obstetric complications, asthma, and obesity. After adjustment for multiple testing, the only statistically significant results included those between maternal diagnoses, predominantly psychiatric, and other neuropsychiatric and neurodevelopmental PGSs in the child. Sensitivity analyses confirmed the robustness of our results across exposure windows, diagnostic settings, and socioeconomic adjustments. These findings indicate that maternal diagnoses associated with autism partially reflect shared genetic liabilities between mothers and their children. However, such genetic effects, as captured by child PGS do not fully explain the observed associations, suggesting additional factors, including e.g., non-genetic familial factors, rare variants, and indirect effects.
Hassani, A.; Pecar, K.; Soliman, M.; Bunyon, P.; Ellinger, C.; Tulysewskid, G.; Croft, J.; Carillo, C.; Wewegama, G.; du Plessis-Schneider, S.; Estevez, J. J.
Show abstract
Background Individuals experiencing or at risk of homelessness face substantial barriers to preventive eye care that are poorly addressed by standard service models. Interdisciplinary optometry-social work collaboration offers a rights-based approach to improving engagement and continuity of care. Methods A convergent mixed-methods study was conducted between February and August 2024 at a multidisciplinary community centre. Clients experiencing or at risk of homelessness received integrated optometry and social work assessment and were prioritised as high, medium, or low based on combined clinical and social risk. Social work follow-up was guided by the Triple Mandate and W-Questions framework. Quantitative data were summarised using mean (SD), median [IQR], or n (%). Qualitative case notes were analysed using content analysis with inductive coding and secondary review for consistency. Results A total of 165 clients had priority categories coded (high: 68; medium: 47; low: 154). Demographic data were available for 132 clients (60% male; mean age 49.5 years [SD 16]); 27% had not completed high school, 89% reported weekly income below AUD 1000, and 28% had vision impairment. Two hundred forty-five case-note entries were consolidated into 146 unique records. SMS (46%) and phone calls (38%) were the most documented contact methods, although only 21% of calls were answered; missed calls (13%) and disconnected numbers (7%) were common. Multi-modal contact was more frequently documented for higher-priority clients. Appointment assistance was the most recorded facilitator (71%), while rights-based supports, including interpreter and transport assistance, were infrequently documented (<=5%). Qualitative analysis identified unstable communication, reliance on informal supports, and service fragmentation as key influences on recall outcomes. Conclusion This study supports an interdisciplinary, rights-based optometry-social work model to address barriers to preventive eye care among people experiencing or at risk of homelessness. Embedding structured handovers and tiered recall processes within community-based services may strengthen continuity and accountability for high-priority clients. Future implementation should evaluate outcomes related to equity of reach, service integration, and sustained engagement in care.
Nordstrand, M.; Fajutrao Falk, S.; Johansson, M.; Pestoff, R.; Tammimies, K.
Show abstract
Genetic counselling outcome measures are increasingly adapted for diverse clinical contexts. While the Genetic Counselling Outcome Scale (GCOS-24) is available in Swedish, no autism-specific version has been developed. Therefore, we adapted the Swedish GCOS-24 using the English version of the modified GCOS-24 (mGCSOS-24) to create a Swedish autism-specific mGCOS-24. Thereafter, we evaluated both the Swedish autism mGCOS-24 and the Swedish general GCOS-24 using Rasch analysis to assess their psychometric properties. Both instruments exhibited structural challenges, including multidimensionality, disordered thresholds, local item dependence, and invariance issues. For the Swedish autism mGCOS-24, we were able to identify subscales with acceptable measurement properties. However, applying the same structure to the Swedish general GCOS-24 did not resolve its broader limitations. This study introduces the first Swedish autism-specific mGCOS-24 and represents the first Rasch-based evaluation of any GCOS-24 or mGCOS-24 in Swedish. Our findings highlight important opportunities for measure refinement but also indicate that new or more substantially adapted tools may be needed to capture outcomes of genetic counselling in autistic populations.
Weill, O.; Lucas, N.; Bailey, B.; Marquis, C.; Gravel, J.
Show abstract
Objectives: Acute gastroenteritis is a leading cause of pediatric emergency department (ED) visits. While ondansetron reduces vomiting, intravenous rehydration, and hospital admissions, its efficacy when initiated at triage remains unclear. We aimed to evaluate whether triage nurse-initiated administration of ondansetron in children with suspected gastroenteritis reduces the proportion of patients requiring observation following initial physician assessment. Methods: We conducted a randomized, double-blind, placebo-controlled trial in a tertiary pediatric ED in Canada. Children aged 6 months to 17 years presenting with morae than 3 episodes of vomiting in the preceding 24 hours (including 1 within 2 hours of arrival), were eligible. At triage, we randomized participants to receive liquid ondansetron or a color- and taste-matched placebo. The primary outcome was the proportion of patients requiring observation after the first physician evaluation. Secondary outcomes included post-intervention vomiting, ED length of stay, patient comfort, and 48-hour return visits. The trial was registered at ClinicalTrials.gov (NCT03052361). Results: Recruitment was stopped prematurely due to the COVID-19 pandemic. Ninety-one participants were randomized to ondansetron (n= 44) or placebo (n= 47). Overall, 40 patients (45%) were discharged immediately after the initial physician assessment, with no difference between the ondansetron and placebo groups (44% vs. 45%; absolute difference -1%, 95% CI: -20% to 19%). No significant differences were observed in all secondary outcomes. Conclusion: In this trial, triage nurse-initiated ondansetron administration did not reduce the need for ED observation in children with presumed gastroenteritis. While being underpowered, this study could inform researchers planning larger clinical trials.
Kohli, S.; Schaffer, E. S.; Savino, J.; Thinakaran, A.; Cai, S.; Halpern, D.; Zweifach, J.; Sancimino, C.; Siper, P. M.; Buxbaum, J. D.; Foss-Feig, J.; Kolevzon, A.; Beker, S.
Show abstract
BackgroundPhelan McDermid syndrome (PMS), caused by SHANK3 haploinsufficiency, is a genetic form of autism spectrum disorder (ASD) that provides a genetically defined model for studying ASD-related circuit dysfunction. SHANK3 mutations disrupt synaptic organization and cortical synchrony, leading to attenuated gamma-band auditory steady-state responses (ASSRs). We investigated whether PMS-related electrophysiological signatures could be identified using machine learning and whether similar patterns are present in a subset of individuals with idiopathic ASD (iASD). MethodsEEG recorded during a 40-Hz ASSR paradigm was collected from 123 participants (42 TD aged 2-30, 56 iASD aged 3-31, 25 PMS aged 2-26). We extracted time-series, ERSP, FOOOF-derived spectral, and intertrial phase coherence (ITPC) features. XGBoost models with leave-one-out cross-validation classified PMS versus TD; the best age/sex-adjusted ITPC model was then applied to iASD participants to derive a Synchrony Atypicality Index (SAI). Unsupervised clustering of high-dimensional ITPC features was also performed. ResultsITPC-based models showed the strongest discrimination between TD and PMS participants (AUROC = 0.83). When applied to iASD participants, 35.7% exhibited elevated SAI, indicating a PMS-like gamma-band phase-locking profile. Classification of iASD versus PMS performed poorly in the full sample but improved markedly after excluding high-SAI iASD individuals, consistent with substantial heterogeneity within iASD. Unsupervised clustering of ITPC features identified PMS-enriched clusters that also captured high-SAI iASD participants. Results were consistent after controlling for age in sensitivity analyses. ConclusionsReduced 40-Hz ITPC is a mechanistically interpretable electrophysiological signature of PMS and identifies a biologically meaningful PMS-like subgroup within iASD, supporting biomarker-guided stratification.